Semi-Supervised Support Vector Machines
نویسندگان
چکیده
We introduce a semi-supervised support vector machine (S3yM) method. Given a training set of labeled data and a working set of unlabeled data, S3YM constructs a support vector machine using both the training and working sets. We use S3YM to solve the transduction problem using overall risk minimization (ORM) posed by Yapnik. The transduction problem is to estimate the value of a classification function at the given points in the working set. This contrasts with the standard inductive learning problem of estimating the classification function at all possible values and then using the fixed function to deduce the classes of the working set data. We propose a general S3YM model that minimizes both the misclassification error and the function capacity based on all the available data. We show how the S3YM model for I-norm linear support vector machines can be converted to a mixed-integer program and then solved exactly using integer programming. Results of S3YM and the standard I-norm support vector machine approach are compared on ten data sets. Our computational results support the statistical learning theory results showing that incorporating working data improves generalization when insufficient training information is available. In every case, S3YM either improved or showed no significant difference in generalization compared to the traditional approach. Semi-Supervised Support Vector Machines 369
منابع مشابه
An Inexact Implementation of Smoothing Homotopy Method for Semi-Supervised Support Vector Machines
Semi-supervised Support Vector Machines is an appealing method for using unlabeled data in classification. Smoothing homotopy method is one of feasible method for solving semi-supervised support vector machines. In this paper, an inexact implementation of the smoothing homotopy method is considered. The numerical implementation is based on a truncated smoothing technique. By the new technique, ...
متن کاملSemi-Supervised Support Vector Rainfall Estimation Using Satellite Images
In this paper we introduce the use of semi-supervised support vector machines for rainfall estimation using images obtained from visible and infrared NOAA satellite channels. Two experiments were performed, one involving traditional SVM and other using semi-supervised SVM (SVM). The SVM approach outperforms SVM in our experiments, with can be seen as a good methodology for rainfall satellite es...
متن کاملA Fuzzy Semi-Supervised Support Vector Machines Approach to Hypertext Categorization
Hypertext/text domains are characterized by several tens or hundreds of thousands of features. This represents a challenge for supervised learning algorithms which have to learn accurate classifiers using a small set of available training examples. In this paper, a fuzzy semi-supervised support vector machines (FSS-SVM) algorithm is proposed. It tries to overcome the need for a large labelled t...
متن کاملOptimization Techniques for Semi-Supervised Support Vector Machines
Due to its wide applicability, the problem of semi-supervised classification is attracting increasing attention in machine learning. Semi-Supervised Support Vector Machines (S3VMs) are based on applying the margin maximization principle to both labeled and unlabeled examples. Unlike SVMs, their formulation leads to a non-convex optimization problem. A suite of algorithms have recently been prop...
متن کامل